On the combination of auditory and modulation frequency channels for ASR applications
نویسندگان
چکیده
This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work [1], we showed that combination of classifiers trained on different ranges of modulation frequencies is more effective if performed in sequential (hierarchical) fashion. In this work we verify that combination of classifiers trained on different ranges of auditory frequencies is more effective if performed in parallel fashion. Furthermore we propose an architecture based on neural networks for combining evidence coming from different auditory-modulation frequency sub-bands that takes advantages of previous findings. This reduces the final WER by 6.2% (from 45.8% to 39.6%) w.r.t the single classifier approach in a LVCSR task.
منابع مشابه
Using WPT as a New Method Instead of FFT for Improving the Performance of OFDM Modulation
Orthogonal frequency division multiplexing (OFDM) is used in order to provide immunity against very hostile multipath channels in many modern communication systems.. The OFDM technique divides the total available frequency bandwidth into several narrow bands. In conventional OFDM, FFT algorithm is used to provide orthogonal subcarriers. Intersymbol interference (ISI) and intercarrier interferen...
متن کاملEvaluation Performance of OFDM Mutlicarrier Modulation over Rayleigh and RicianStandard Channels Using WPT-OFDM Modulations
Last years, Wavelet Packet Modulation (WPM) or Wavelet Packet Transform based Orthogonal Frequency Division Multiplexing (WPT-OFDM) have been introduced to wired and wireless communication fields as efficient Multicarrier Modulation (MCM) techniques. The wavelets have interesting features such as flexibility, compatibility and localization in both time and frequency domains with no need to use ...
متن کاملO23: Modulation of Pacemaker Channels and Rhythmic Thalamic Activity by Demyelination and Inflammatory Cytokines
The thalamus is a central element for the generation of rhythmic oscillatory activity under physiological and pathophysiological conditions. Especially slow oscillations in the delta and theta frequency band which normally occur during slow-wave sleep are associated with a number of neuropsychiatric conditions if they occur during wakefulness and may be the basis for the generation of character...
متن کاملAn Efficient Hierarchical Modulation based Orthogonal Frequency Division Multiplexing Transmission Scheme for Digital Video Broadcasting
Due to the increase of users the efficient usage of spectrum plays an important role in digital terrestrial television networks. In digital video broadcasting, local and global content are transmitted by single frequency network and multifrequency network respectively. Multifrequency network support transmission of global content and it consumes large spectrum. Similarly local content are well ...
متن کاملRobust speech recognition using the modulation spectrogram
The performance of present-day automatic speech recognition (ASR) systems is seriously compromised by levels of acoustic interference (such as additive noise and room reverberation) representative of real-world speaking conditions. Studies on the perception of speech by human listeners suggest that recognizer robustness might be improved by focusing on temporal structure in the speech signal th...
متن کامل